[Backend Tester] Seed based on test name #13313

GregoryComer · 2025-08-12T06:07:30Z

Set a manual seed for pytorch based on the test base name (test case not including flow / etc.). This makes test results stable between runs and between backends/flows. This is useful for comparing accuracy between backends, for example.

I validated this change by running convolution tests for xnnpack twice. I validated that the output accuracy statistics were identical.

[ghstack-poisoned]

GregoryComer · 2025-08-12T06:07:31Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-08-12T06:07:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13313

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 6 New Failures, 3 Unrelated Failures

As of commit e7b7975 with merge base d7ecd87 ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / unittest / linux / linux-job (gh)
backends/test/suite/tests/test_reporting.py::Reporting::test_csv_report_simple
pull / unittest / macos / macos-job (gh)
backends/test/suite/tests/test_reporting.py::Reporting::test_csv_report_simple
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 2db693d225cd5a3612f384afb2f58cbc10fbdfcab08e44e50c7cfc7b1eae1ade /exec failed with exit code 1
pull / unittest-editable / linux / linux-job (gh)
backends/test/suite/tests/test_reporting.py::Reporting::test_csv_report_simple
pull / unittest-editable / macos / macos-job (gh)
backends/test/suite/tests/test_reporting.py::Reporting::test_csv_report_simple

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-eval_llama-wikitext-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-llama_runner_eager-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (trunk failure)
backends/arm/test/models/stable_diffusion/test_vae_AutoencoderKL.py::TestAutoencoderKL::test_AutoencoderKL_tosa_MI

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 10756ac ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: fc23afa ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: 6f79864 ghstack-comment-id: 3177836622 Pull-Request: #13313

digantdesai · 2025-08-12T11:17:09Z

backends/test/suite/runner.py

@@ -40,6 +41,16 @@
 }


+def _get_test_seed(test_base_name: str) -> int:


Why not set a new, global seed every run? And print it somewhere to reproduce. Hardcoding seed ==> we will test with same random numbers every time, not sure if that's what we want.

Update to generate a random run-wide seed and allow specifying a seed from CLI.

[ghstack-poisoned]

ghstack-source-id: 0a4ac57 ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: a1a6fba ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: a1a6fba ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: abd8e0d ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: abd8e0d ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: f3c1119 ghstack-comment-id: 3177836622 Pull-Request: #13313

[ghstack-poisoned]

ghstack-source-id: f123042 ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

733d4f9

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 12, 2025

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

3577326

ghstack-source-id: 10756ac ghstack-comment-id: 3177836622 Pull-Request: #13313

GregoryComer requested a review from digantdesai August 12, 2025 06:12

GregoryComer marked this pull request as ready for review August 12, 2025 06:12

GregoryComer requested a review from cccclai as a code owner August 12, 2025 06:12

Update

8f476ea

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

f380695

ghstack-source-id: fc23afa ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

a62c6d0

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

71f49c2

ghstack-source-id: 6f79864 ghstack-comment-id: 3177836622 Pull-Request: #13313

GregoryComer mentioned this pull request Aug 12, 2025

[Delegate Testing] Add report generation for key AoT metrics for P0 delegates #12897

Open

digantdesai reviewed Aug 12, 2025

View reviewed changes

Update

bd786cd

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

109d1f4

ghstack-source-id: 0a4ac57 ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

a807a90

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

c5c1132

ghstack-source-id: a1a6fba ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

692f0fa

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

ea92ab7

ghstack-source-id: a1a6fba ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

e96c2ef

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

75a40b4

ghstack-source-id: abd8e0d ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

8aa25c7

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

43af2a6

ghstack-source-id: abd8e0d ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

7ba2a7f

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Seed based on test name

4d677ec

ghstack-source-id: f3c1119 ghstack-comment-id: 3177836622 Pull-Request: #13313

Update

e7b7975

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 13, 2025

[Backend Tester] Seed based on test name

382a0aa

ghstack-source-id: f123042 ghstack-comment-id: 3177836622 Pull-Request: #13313

GregoryComer added a commit that referenced this pull request Aug 13, 2025

[Backend Tester] Seed based on test name

cd581b3

ghstack-source-id: f123042 ghstack-comment-id: 3177836622 Pull-Request: #13313

GregoryComer mentioned this pull request Aug 13, 2025

[Backend Tester] Add test flow CLI arg #13360

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Backend Tester] Seed based on test name #13313

[Backend Tester] Seed based on test name #13313

GregoryComer commented Aug 12, 2025 •

edited

Loading

Uh oh!

GregoryComer commented Aug 12, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading

Uh oh!

digantdesai Aug 12, 2025

Uh oh!

GregoryComer Aug 13, 2025

Uh oh!

Uh oh!

		@@ -40,6 +41,16 @@
		}


		def _get_test_seed(test_base_name: str) -> int:

[Backend Tester] Seed based on test name #13313

Are you sure you want to change the base?

[Backend Tester] Seed based on test name #13313

Conversation

GregoryComer commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GregoryComer commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13313

❌ 6 New Failures, 3 Unrelated Failures

Uh oh!

digantdesai Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

GregoryComer Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GregoryComer commented Aug 12, 2025 •

edited

Loading

GregoryComer commented Aug 12, 2025 •

edited

Loading

pytorch-bot bot commented Aug 12, 2025 •

edited

Loading